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(54) System land method fffccollaborative ranking of search results employing user and group 
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(57) : X ^ys^tearf for ranking search' .results obtained 
from an Information retrieval system 1 1 ncludeis a search 
pre-proc&>sor.(30), ajsearch engihej (20) ;arid a search 
post-processor j(40). J The s^rchj d re-processor (30) j 
determines the iqntekt of the searjsh query by compar- 
ing the terms rn ; trje search query with a predetermined 
user context profile.. Preferably, the context profile is a 
user profile 'or a community profile^which includes a set 



: of terms which havfe been rated by the user, community, 
or a recommender system. The search engine gener- 
ates a search result comprising at least one item 
obtained from the information retrieval system. The 
search post-processor (40) ranks each item returned in 
the search result in accordance with the context of the 
search query. 
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Description ; • { 

[0001] This invention relates generally to informa- 
tion retrieval' systerris 'arid more particulaVlyj to a system 
and method' 'of icollabprafively. ranking results returned 
from a search engirie;usihg user and group profiles. 
[0002] The WprJdjWiile Web (the |wpb" or "WWW") 
is an architectural framework for accejsjSlhg documents '* 
(or web pages) stored on a worldwide Jietwork of distrib- 
uted servers called 'trie Ihternet: Documents stored on 
the Internet are' defined as web pages^The architectural 
framework of the web integrates webj pages storeo* on 
the Internet using jlijkp. Web 1 pages consist of elements 
that may iricliide ; ieit, ' brapihics/ images; video and 
audio. A web . pag£, : whjich . points to the, location lof 
another web pafcje, |s said to be linkedjto that other web 
page. Links that are! set forth in a, web page usually take 
the form of a text frajgfoerjt or an image. A user follows a > 
link by selecting it ' j " t 
[0003] With; the advent of networking technology 
and the World Wide Webj, the ability : to. access informa : 
tion from external- screes has preatly increased. Vari- * 
ous search engine^, enable ; a user to submit a' query, 
which returns a' coltectioni of items or documents j A well- 
crafted query may jreturn a manageable :set of docu : 
ments, typicallyifrprn 30 tj> 50 documents] A less; narrow 
query may return oyer 1fJoo ; doqumerts} 'An 6vdrly;narr 
row query mayreturrf nbi docurrtents i in: which case no - 
ranking is required). Various techniq les are available 
for assisting the user in; refining or narrowing his/her 
search query. .Howeyer,! once the search result has 
been properly narrdWed,j a significant 1 problem in infor- . 
mation retrieval is rjoW to rank the rejsults returned by 
the searcrii : engine Jor I the combination of j search 
engines. '' >, ■ ■ ■* ' j i ;' . ; 

[0004] For ([indhVidual search engines, there ( are 
many different tecbhiiquejs for ranking rjesults, ranging 
from counting the frequency of the afcpjearance of the 
various search terms Jn'the search qufery to calculating 
vector similarities: betwepn a search! term vector and 
each returned document vector. In a networked environ- 
ment such as the World Wide : Web 1 ,' meta-searchers 
access different a^b* often : heterogeneous | search 
engines arid;fpSce|tiJie ackfrtional difficulty of combining 
the rankingj. inforrrjation: returned by the; individual 
engines. Meta-searbher js a Web information retrieval 
system aimed at searching answers td ajuse^s query in 
the heterogeneous jnforrnation pVovidArs distribute pver , 
the Web. When a taetaisearcher receives responses 
(usually in the fomvof HTML files) frojn the information 
providers, a special component of ja meta-searcher 
called a wrapper, process the responses to answer the 
original query. Sina^j many search e*V<jines, ; including . 
meta-searchers, hitjter th^ mechanism used for docu- 
ment ranking, theiprpblem of merging search results; is 
compounded. A problem common to both' individual 
search engiries; ; ano[ meti-search engjn^siislith^t these 
approaches ignbre.ior knowing nothing ptbout, the user 



conducting the search, or the user's context for conduct- 
\ ing the search. 
[0005] Relevance; feedback is one approach; that 
elicits information abjfrut the user and his/her search 

5 context. Relevance feedback techniques re-rank the 
search results by using user feedback to recalculate the 
relative importance of key words in the query. While 
powerful from a technical point of view, relevance feed- 
back approaches suffer from user interface issues. The 

10 relevance information required is often difficult to elicit 
successfully from users during the search process. U.S. 
' Patent, No. 4,996,642, to Hey, System and Method for 
Recommending Items, describes a system for providing 
recommendations to users based on others items previ- 

15 oiisly sampled by the] luser and the availability of the 
items. 

: [0006] ; Knowledge Pump, a Xerox system, provides 
community-based recommendations by initially allowing 
users to identify their interests and "experts" in the 
20 areas of those interests. Knowledge Pump is then able 
to "push" relevant information to the users based on 
those preferences. This is accomplished by monitoring 
network traffic to create profiles of the users, including 
; their interests and "communities of practice," thus ref in- 
25 ing the community i specifications. However, monitored 
or automatically created profiles for establishing context 
; nrjay not accurately reflect the user's context at all times. 
[0007] There is a need for a system and method of 
ranking search results which does not require user 
30 solicited relevance information. There is also a need for 
a ; system of ranking search results which takes into 
' account a predetermined user context profile. There is 
' also a heed for a system and method of ranking search 
results which; ranks results based on a user selected 
35 context. There is also a need for a system and method 
of ranking search results which takes into account a 
group or community to' which the user belongs. There is 
• a further heed for a system and method of creating a 
user and community profile for ranking search results. 
40 [0008] A system for ranking search results obtained 
from an information retrieval system, according to the 
« invention, includes a search pre-processor, a search 
' engine, and a! search fcost-processor. The search pre- 
- processor, responsive, to a search query, determines 
45 the context of the search query by comparing the terms 
in the search query with a predetermined user context 
profile. The user's context profile may include, for exam- 
' pie, the user's identity, the community or set of commu- 
nities applicable to the search, and the point of view the 
so user wishes to adopt (e.g., that of a domain expert) for 
the search. Preferably, the context profile is a user pro- 
, file or a community profile, which includes a hierarchical 
> set of. terms that have been rated by the user or commu- 
.' nfty. Also, a recommender system may be used to gen- 
55 erate the user, or community context profile. 

[0009] The search engine, responsive to the search 
. query, generates a search result comprising at least one 
item obtained from the information retrieval system. (If 
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no items are Turned, such as jWhen the search is 
overly narrow, no tanking is required.) Generally, a great 
number of Hems will) be generate^,, which the search 
engine will proyicje In its own predetermined jforrh "of 
hierarchical valuation. The search ppstj-processor, 
responsive to:a| hori J empty search result, ranks, each 
item returned in the search result in accordance with the 
context of the search query. The ranked results may 
then be prcjvide^or displayed in jariy. normal fashion, 
such as on : a computer display preprinted put; If more 
than one search engine is used,|each search engine 
returns its own list of search results. The post-search 
processor then rafiks all items returned, regardless of 
search ; engine, jnj accordance; with Jthe context of* the 
search query. 1 I, : \ « , 

[0010] ; ■■ A method of ranking search results obtained 
from an Information; retrieval system,' according to the 
invention ■inpludpsprpviding.a predetermined user con- 
text profile, gendratintjj a search qu|ery, arid ajpplying the 
context profile to; ithe search query to "generate d context 
of the search qqery. A search is trjen performed based 
on the search' queryj which includes at least one item 
obtained from the information retrieval system. The 
search results ataj thfen ranged in 'accordance with the 
context of the sajarch jquer^. *' \ , , ; 1 j 
[0011] i The system and method j according to the 
invention:coup1e,s f a predetermined user context profile 
(e.g., user i| profiling,! community profiling j or recom- 
mender profiling) with the search brocess. ^y coupling 
context profiling !wiith tihe search process, search results 
are no longer'ah isolated event, fcjut are ranked within 
the context of a particular user or community or recom- 
mender system ppint;of view. Defending on the user's 
context for trie s6archj a different predetermined context 
profile may be 'selected, th^s^custpmiiihg the ranking of 
each particular Search. ! /' j 

[0012] j, The userand community profiles are built by 
analyzing, document j collections put! together by the 
users and trje Communities tp which the u'seijs belong, If 
any of the retrieved search results ate ^considered rele- 
vant to the user or the community^ they can be used to 
tune or modify th$ particular; user lor community profile 
by re-weighiing the profile terms. : 
[0013] Useriapd dommiAiity ,prbfi|ing is particularly 
useful in the invfsrition. First/ the post-processor uses a 
particular cianteicf prqfile (either the' user's profile, the 
community's profile or another user's profile ^ such as a 
known expert inja domain outside the user's! expertise) 
to rank the results of a search quejrys jPrefer$bly, a user 
profile is build! fijom a; user Selecting: a; particular docu- 
ment collection and ranking br ratinig the various terms 
within the ; document collection! The user profile 
becomes, the document collection 1 with rating informa- 
tion attached to eachidocument. A r user can have rnore 
than one user cpntext: profile/ or use another {user's con- 
text profile .in orcjerto (an k the search; results most expe- 
ditiously; according tp a particular point < of view. The 
ranked search results can be used to update the user 



profile based on new submissions or documents pro- 
duced in the search and ranked using the user's con- 
text. This approach is similar to relevance feedback. 
1 ; [0014] : Similarly, a community (i.e., a group of users 
5 j having similar interests) profile can be built by categoriz- 
ing the documents, in a document collection into the 
; ; communities (when jsuch a construct exists in the docu- 
ment collection) and then ranking the various docu- 
\ ments 'according to the particular users belonging to the 
10 community. A user's ability to rank documents within the 
community will vary according to his/her levels of exper- 
tise. Various methods to approximate a user's level of 
expertise within a community can be used (e.g., by 
■ 5 agreement, by statistics, etc.). 
15) [0015] [The 'system of the invention provides an 
' architecture that allows these methods to work together 
in support of corfimunity-based relevance feedback. 
The system and rpethod of the invention provide the 
; ability ;to rank results returned across multiple search 
20 engines and the ability to take into account the user's 
context through use of user, community or expert user 
profiles. 

Figure 1 illustrates an example of a distributed 
25 \ ' operating ; environment for performing the present 

. . » invention; ; ( 

i ■ j ' ■ ( - i i. 

Figure 2 is a block diagram of a system for ranking 
. search results obtained from an information 
30] retrieval system in accordance with a predeter- 
mined context profile. 

[0016] Referring now to the figures, Figure 1 illus- 
, trates an example of a distributed operating environ- 
as* ment for -performing the present invention., In the 
, , • distributed operating environment illustrated in Figure 1 , 
!; client computers 1^02, request searches, communicate 
with other client computers and retrieve documents (i.e., 
i web pages), stored on servers 104 for either viewing, 
40 ' storing, or printing. The client computers 102 are cou- 
pled to the servers 104 through internet 106. Some cli- 
ent computers 102, which are located on an Intranet 
110, communicate indirectly with servers 104 located 
on the Internet 106 through a proxy server 1 12. The cli- 
45 ! ent computers 102 ; may consist of either workstations 
; 114 or laptops 1,16. Alternatively, the client computers 
, i : 1 02 may request searches, communicate with other cli- 
ent computers and: retrieve documents (i.e., web pages) 
; stored on Intranet servers such as proxy server 1 12 for 
so ; either viewing, storing, or printing. 

[0017] Referring to Figure 2, a system for ranking 
search results obtained from an information retrieval 
system according to a predetermined user context pro- 
j file is generally shown therein and referred to by 
55 numeral 10. System 10 includes search pre-processor 
| 30, which! takes a query 102 from a user 100 and 
... applies a predetermined user context profile to deter- 
mine the context' of 'the search query. The user context 
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profile may be a gsprf profile generate^ by user profiler 
50 or a community; profile generated; by a community 
profiler 60. Results from' the search query, which gener- 
ally include a plurality pjF hierarchically-ranked; search 
results based on the query, are returned by the jvarious 
search engine^ 26, on meta-pearcji ^engine ■ ' 80 by 
searching an inforrrjation! retrievarsysterrv(s,uch as the 
Internet). These search results are thlen ranked by the 
search pdst-processoV 40 and providpd to the user in 
the form of ranked documents 124. \ , 
[0018] Community ; profiler .60 : ranks community 
document colliectibrj 70 in accordance with evaluations 
or rankings deterrrioied b'y the members! of community 
130. In some peases 1 ,: community manager 120 mjay 
determine from! time to time whethera particular user 
may join or continue to b4 apartiof the| community. User 
profiler 50 ranks a selected document collection (Which ' 
may also be the community document collection 70) in 
accordance with evaluations or ratings^ by user 100. 
[0019] The 1 system may be extended to support 
community-based nilavartce feedback. 1 Iri addition to the 
search pre;-processbr| one 6r more : sea,ncti engines or 
meta-searpri' engines; an*d search post-processor, the 
extended systern ma^ iriclude one or more; document 
collections withiasspciate'd user, community/groMp, and 
rating attributes, a.;user iprofilef, a community' profiler 
and a community manager. Additionally,! the extended \ 
system may jnclude wrappers that allow the profilers to 
extract document content (or document reference, such 
as its URL); user, -community : and. rating information 
from the document collections and wrappers that allow 
the search pre-processpr to' submit queries -to* the 
search engine; and &e sdarch post-processor to extract 
the results. :. , j ' ■ j ( ' !•',!' i ; 

[0020] Tbe.jdocumenjt collection; may t>e;orje (or a 
combination) cif .several different Wpes: oloduments 
residing in a document management system iir a file j 
system or documents referenced by^ a recommender 
system. In eacrvcase;th$ document collection provides 
a specific methodology ! for associating content with 
users and potentially ijvithj communities' of users. In each , 
case, the document 'collection 'provides; the- basis for 
establishing ith^ Usirco(rtext profile, 'in'jhat the docu- 
ment collection anctiuser ratings establish Ithel environ- 
ment or the iptqiTela^ed conditions under which the user 
desires to rank search rebu Its.' , j . y i ;| I . 
[0021] Preferred . dqcument' : collections jinclude , 
those provided .by cc-nrirnunity reconlmender systems 
which attach user ideptrfication, community categoriza- 
tions and userjratirj^s to the documents. Using docu- 
ment collections; : j generated by community 
recommender systerris allows ;use t - of the most spphisti- 
cated of the; -user 4nd community profiling techniques 
described below. iPteferably, a community-based rele- 
vance feedback! system jncludes a recommender sys- 
tem as one of trje document collections, 6riprpfefab|y as 
the principal one. 1 j.j , . ■ ; ■ ; j 

[0022] An important aspect of the system for rank- 
: ; 1 i' 
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ing Is the document collection used to generate the con- 
text profile. The document collection may include an 
application program interface (API) for allowing the pro- 
filers to query for: alt documents submitted and/or 
reviewed by a user (wfio may be associated with a par- 
ticular community). -If such an API is not provided, then 
a wrapper suitable for extracting the information may be 
used. A wrapper is a tool used by a meta-searcher that 
scans the HTML files returned by the search engine, 
drops the markup instructions and extracts the informa- 
tion related to the "query. Then the wrapper takes the 
answers from the different providers, puts them in a new 
format and generates jan HTML file that can be viewed 
by theiuser. The API W the wrapper generates "meta- 
data* which is used by the profilers to construct and to 
incrementally update the user and community profiles 
from the set of documents relevant to the user and in the 
context of the community. In the case of standard docu- 
ment collections, (such as file systems or document 
management systems), it is generally assumed that any 
document filed or stored by the user is relevant to the 
u$er. In the case of ;a recommender system, it is gener- 
ally assumed that any .document submitted or reviewed 
by a user with* an average or higher rating is relevant. 
[0023] The search ^engine may include an API for 
submitting a search and retrieving results. If not, a suit- 
able wrapper may be used. The problem of query trans- 
lation across multiple, heterogeneous search engines 
and databases and the extraction of the search results 
is well known. Thus, any commercially available transla- 
tion and extraction product may be used. However, it 
should be noted that search engines do not necessarily 
cover the documents jn the system's document collec- 
tions, although overlap is always possible. 
[0024] The search pre-processor determines the 
context of the User's search. The search pre-processor 
applies a predetermined context profile to search query. 
For example, the context profile may include the user's 
identity, the community or set of communities appropri- 
ate to the search, and the point of view the user wishes 
to adopt for the search, if any (such as that of another 
user or a domain expert). The context profile can be 
retrieved explicitly by asking the user to identify him/her- 
self and by asking: the user to select the appropriate 
community or communities and/or point of view. Context 
can also be determined (deduced) automatically by 
matching the query with a query memory associated 
with a community (if selected) or the collection of users 
using the system. 

[0025] A preferred context profile is that of the user. 
The user profile is created or generated by the user pro- 
filer, which constructs . a term-weight vector for each 
u$er which is ^extracted from the set of documents sub- 
mitted and/or .reviewed into each of the document col- 
lections to which the user participates. Matching a user 
across several different document collections is not 
always simple. One method of accomplishing this is to 
ask the user to provide his/her identifier (and password, 
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if needed) for eAch document collection. Jf a. user with- 
holds. sppie of this information, then; his/her profile will 
be less 'complete fhan for other users Who do not with- 
hold this' information \ However, this is i hot always the 
case as a user may choose only io] provide access to 
particular document collections deerhed appropriate by 
the user. This rirobiem only occurs' is more than one 
document collection is used in the System, which is pref- 
erably the :one>- provided by the, community recom- 
mender system.!; ; j , ; 

[0026] " The te^m-weight vector or : user profile is 
calculated ;in a ^tandprd way, although: various linguis- 
tic-based enhancements are possible: as noted below. 
For a user.'jo.ime ivejtfor includes thfe set of terms {f,} 
with their weights, wj",, -P u = {f,.< i*^};: lf| the term- 
weight vector tejat least. calculated In. part from docu- 
ments that have^beeji evaluated implicitly or explicitly 
rated) In some wa^ by the user, thejn the ratings given to 
the documents dan be used to biasjthe term-weight vec- 
tor ", J; ; - ; ' 
[0027] 11 The usjer, profiler may also Calculate <the pro- 
file of the, user ih theicoritext of a Community or a spe- 
cific domain or domains. In this case'.rthe user profiler 
would take into accoijint only those documents submit- 
ted or reviewed by thi user and classjf jed;(ejther by the 
user or automatically) into the. domain. An. added diffi- 
culty in this ca&e, is' matching communitios/domaihs 
across document {collections. Again, if there is only a 
single document Collection, the rocommender system, 
this difficulty disappears. The usef profiler provides an 
API that returns 'af tertm-weight vector in response to a 
user identification; arid possibly a community/domain 
identified j [ V ' ' j J ;, ' ; 

[0028] \ Similarly, the community profile P? is created 
or generated by a ccjmrriunityf profiler.; The 'community 
profiler constructs a term-weight yector for leach, com- 
munity, which is^l extracted from the ; set of documents 
classified into a cqmmunity within each of the document 
collections. The term-weight vector for the community is 
determined" in a, way! analogous to . that employed for 
users. The comrrijjnity vector contain^ the set of terms 
{tj} with, their wsjjqhts: w°\\ P c F { t it ,w° j} The weight 
of each term is ;calcUlated from the weights' w", of the 
individual community: members (users j: : Since the con- 
tributions: of ;thie mempers are frequently much different 
from one another," the community prbfile bar) be biased . 
to weigh more heavily! the contributionipf "experts" in ttie ' 
community (special users). Experts are those commu- 
nity members; whose recommendations are most fre- 
quently followed; ;by :the whole community. Formally, 
each member.t/iri the community 'is assigned a weight 
a u . Experts have the highest and for the whole com- 
munity: i: i ! ' !: \ ■ I • i ■ . 1 ; 



The individual a„ must be re-normalized whenever a 
user enters or leaves the community. Then, the weight 
. of term f/ in the corlnmunity profile is evaluated as: 



5 
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w i = L a u w i - 



where w"/ is the weight of f/ in the profile of user u. 
10 Beyond the community and personal (user) profile, the 
' user can request the profile of the community expert (s), 
| which contains weight w w / for each profile term f,. The 
I ' community profiler;! provides an API that returns a term- 
weight vector inTesj)onse to a community identifier. 
is [0029] ; When registering a new user u to commu- 
i nity, the initial user profile P u - (t *v,) can be one of 
1 the following options: the community profile (t h w{)\ a list 
of user defined keywords t } (the weights W; are equal or 
induced from community profile); or empty. Any docu- 
20 ment reviewed or submitted by the user changes her/his 
I profile as follows. - 

i [0030] j If (a hew) document D is submitted, all terms 
' ' and their associated weights are extracted from the doc- 
ument, k top-weighted terms are then selected such 
25 that a document profile is created: D = {f jt tf j) , for /'= 
I 1 ,...,/c, where k is a system/application-dependent con- 
: stant. Each document D reviewed and in the document 
collection has its own profile D = (t' it w*j) . Otherwise, 
for a reviewed document, its profile D = (t' jt w'j) is 
30 retrieved from the repository where it is stored along 
; with the document. 

i [0031] | The current user profile vector P u =(t jt w f ) 
' and new document profile D=(f' /f W,) , are used as fol- 
lows to up'date the' user profile. For each term in set {// 
35 u tj], we evaluate'. 



50 



new ,4 . ' 

Wj =yxw / +(1-y)xw / , 



40. where y is a "profile conservativeness" constant, 0< y 
: <1 .The closer y to 1 , the slower the profile changes with 
; new submissions.; Practical values of y are ih range 
[0.5,0.95] and canjdepend on the number of user sub- 
missions (over last rn days). , 
45 [0032] Only, trie k top-weighted terms t"*" are 
; chosen for the new user profile P tJ and their weights get 
; : normalized: 



. [0033] j When creating a community, the community 
i administrator (which can be a human or a software pro- 
gram) can use lor the initial community, profile 
55 P c -{t jt w jt ) one of the following options: process sam- 
ple document(s) relevant to the community and extract 
terms and weights as with a user submission described 
above; use a list of community keywords f/ given by 
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administrator, or leave 1 it empty. Any ddcuhieht reviewed 
or submitted- by a^cbmjnunily member! changes the 
member profile: Beyond the 'member profiles; tt]e com- 
munity profiler maintains values of contribution aj, for ! 
each community member u. j ' 

[0034] For user.u in the community, its contribution 
a u can be evaluated: as i 



where r u is a riumtpteV: 6lj documents s jbmitteid by the 
user (over the last m days) and f u is thfe number of com- 
munity users that fojlowed those- recommendations, v is 
a customization coteticient; it may favor a user with 
numerous, but moderate,; recommertdatibns rather than 
a user with one but a -popular recommendation. 'The 
user with th&. higher value oty is cail4d the community 
expert and his/her prdfile lean be used as an expert pro- 
file P 6 *? by other users for re-ranking the ; searchlresults. 
Values a u are; kept normalized suj:h;jthat ; 2ia u; =1. 
Optionally, experts clan bfe chosen 1 or assigned^ by com- 
munity members without statistical evaluation. 1 
[0035] For all members of the community, their pro- 
files P u = (tj", Wj") and their contributions a u are 
used for updating .the[community profile.; For eabh term 
f/ in 



its weight is 



i mem- 



The community; prbWe cajn keep all termsifroi^h i ; 
bens' profiles or.ohlyjthe jctop-weightddierms;;ih either 
case, their weights are kept normalized: iwf=\ . ; 
[0036] The updalp of the commijnity profile P° is 
performed preferably \jvhen a minimally required; 
number of user prtifjle changes have ; occurred; The 
community profile- ubdatd is processor time-consuming; 
thus it is preferably |b=update.the profile off-line. : 
[0037] in a preferred embodiment! the, system 
according to the;inventiofj) incorporates a [recommender 
system. In addjtion to the storing of oiourtieht profiles, 
user profiles community profiles' and fexftert profiles, a 
recommender system would also inplude or provide 
tools for profile retrieval and profile updates. 
[0038] Matching! community definitions across doc T 
ument collections aWd maintaining, a coherent list of 



communities and users; participating in those communi- 
ties is frequently difficult If a community recommender 
, system exists within the system, then its list of commu- 
nities is a likely candidate for adoption for the commu- 
5 nity relevance feedback system. Alternatively, an 
administrator of the system could be responsible for 
matching groups or collections in other document col- 
: lections with the community list The task of constructing 
'. such a list from scratch : would fall to the administrator in 
io the absence of such a list It is also possible to create 
automatic methods ; ; of performing the matching, 
although this may possjbly reduce the accuracy of the 
community profiles. It is possible that the way in which 
^. the cortirriunity list is constructed and matched across 
is collections that the end result will be that the community 
profile is entirely determined from the data in the com- 
munity recommender system, if such exists, or other 
; document collection with a notion of the community. 
[0039]i' it should be noted that a community profile is 
20 not required iri order to practice the system and method 
of the invention. In some instances, it may be appropri- 
ate to take into account only the user's context, in the 
absence of the community, although such a system will 
• be more difficult to accept and add new users. 
25 [0040] A method of ranking search results obtained 
from an information retrieval system using a predeter- 
mined user context profile would include the following 
; steps. Before a user starts formulating queries, because 
' of the greater, benefits available from a recommender 
30 system, the user is assumed to have registered with a 
recommenden systjenrt.! This permits the system to 
upload the user profile, jcommunity profile and/or expert' 
profiles chosen by the users for search result ranking 
! and profile re-weighting. 
35 [0041] Once a user has formulated a query, the 
search pre-processor takes the query and processes 
the keywords in the query to a query profile 
: P q - {tf q ,w f q ) . This profile is used by the search post- 
r processor later. 
40 [0042] Wrjen the c|uery is submitted to a search 
engine and the search result returns, the user sees the 
documents determined! from the search query, listed or 
ranked in accordance with the algorithm provided by the 
' particular search engine(s), if any. Since this ranking will 
45 likely not rank the results within the context desired by 
the user, the user can request re-ranking. Alternatively, 
the re-ranking by the search post-processor can be 
automatic. 

[0043] The search post-processor takes as input 
so the list of search results returned by the search engines. 
It has two preferred ways to evaluate the relative rank of 
the documents. One is by matching the document or its 
pointer (e.g., its URL) to one already existing in one of 
the document collections. For example, if the document 
55 has been rated by a community recommender system 
connected to the system within the appropriate commu- 
nity context, then this rating is given a high weight in 
determining the. relative rank of the result. 
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[0044] The secortd preferred means' of evaluating 
the relative ranking of documents js by using the profile 
term-weight 1 vectors ais a source of relevance feedback. 
Depending orijhe context (i.e., lUser, community or 
expert), the appropriate profile is requested.!,' > . 
[0045] ! ft jEadfe document in the search result is down- 
loaded, iie;,: the full contents of each document is down- 
loaded for;term:extrajction; The extraction is similar to 
that for submittal documents when a> user context pro- 
file is being created; from a ^ocunrienVcdl lection as 
described above! the 1 search post-processor returns a 
document profile f d \= (t d /f w d ,j for each document 
d that contains k> top-jwelghted terms. Generally, a doc- 
ument in a sedrdhjresponse is hot considered as impor- 
tant as a . su^rtiittdd document in j the i j document 
collection, jthus the number /f 1 of terms chosen for the 
search returned;dpcumentiprofile may be less than that 
for a submitted document.: If a document in jthe search 
result is alreadyjn the document colliedtiqh,' its profile is 
not extracted frdm the document tJuf is up-loaded from 
the document storage. 1 „ : ■ i * 



[0046] :. Once;a' profile P° = (r 



is obtained 



for each document diin the search response, the rele- 
vance (or ranking^ of document ct with respect to, the 
chosen user/community/expert profile! is obtained by the 
formula: ..*!'!■« 1 1 ! ; f : : 



relevance [d) 



10 



15 



20 ' 



25' 



30i 



where wf m ! : are* term weights in the selected user, ' 
community,, or jexpert profile. Trjen; 1 documents are 
sorted based pn their calculated rank values and pre- 
sented to user; '**' ; ! " ' ■ I ' 
[0047] :*;Since,the profile-based o'ccurrjent re-ranking 
takes someiirtie; (needed forfthe deferments jdowni-load- , 
ing, term extraction ahd rank calculation); ,thb user may 
request for re-ranKing, switch to cjther activity (or con- 
tinue search) an^ return back to re|-ranked results later. 
Alternatively, the; user; may request persistent queries, 
when user queri^ are executed byline. - : : \- ] 1 
[0048] . Note jth[at th e te rm weights • w^/ are : evai u ated 
from the^docunrjentd content only Since a response 
can actually , be a;bri$f description ofj aboriginal docu- 
ment, the term weightjs n^/'in this case may fc)e quite dif- 
ferent fromithe terrn weights, vf 3 ] i(i {he case when, the 
full document is -available. Although ,ttie relevance rank- • 
ing is then biased by the profile (through the term wf mf 
), there is 1 astandard (tradeoff between the length of the 
response documents ;and the qualify of the ranking: the 
longer the document, jthe npore.prec&eithe ranking (and 
the longer rt tates to; perform the* ranking). 'After each 
documertt in trie; search result is ranked, the; results are 
displayed in ranked order to the user. j '". \; 



35 ■ 
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[0049] Either kind of user search (with or without re- 
ranking) can lead to finding documents relevant to the 
community. Such documents submitted by the user 
after the search may be used to change or modify the 
user profile and, consequently the community profile, as 
discussed iabove. 

[0050] , Not all s$arch engines are necessarily exter- 
nal to the search. ;An internal document collection can 
also be searched with a query. The post-processing in 
this case is simpler. Indeed, the term frequency vector 
for a document can be extracted a prior, stored along 
with the document in the collection and reused in the 
relevance ranking each time the document fits a query, 
thus reducing the ranking time. 
[0051] ; If a user, provides a positive feedback to the 
search result or documents retrieved over the process, 
the search results can be included in the document col- 
lection like any othefr recommendation. Additionally, the 
search results canjbe used to modify the user profile by 
re-weighting term weights. In such a case, the query 
terms and/or most frequent terms in the response form 
a set {Rel} of relevant terms. Using this set, the stand- 
ard Rocchio formula for relevance feedback can be 
used. . The main difference between the approach 
described herein and standard relevance feedback is 
that the approach- of the invention does not take into 
account npn-relev&nt terms since this approach does 
not have a reliable-way of extracting this kind of informa- 
tion from both; document collections and search results. 
As a result of re-weighting, the relevant terms from [Rel] 
have their weights increased in the user profile. 
[0052] The search post-processor generally 
requires textual content in order to evaluate the compar- 
ative relevance of the returned items in the context of a 
given user, or community profile. This means download- 
ing' either an abstract (If available) or the entire docu- 
ment. Th ! entire : process could become quite lengthy 
especially if the; nurjnber of documents returned by. the 
query is large: A first step of prefiltering may be neces- 
sary in order to prune the list to a manageable number. 
However, white the time cost is high, it should be 
remembered that the time cost to the user of evaluating 
the returned documents him/herself is even higher. In 
many cases, users may be willing to turn collaborative 
ranking on, return to other work at hand, and wait until 
an alert indicating the collaborative ranking process has 
terminated. As a further incentive to use the collabora- 
tive ranking feature; the items downloaded in the proc- 
ess can be cached locally, so that subsequent browsing 
by the- user will be much less time-consuming. 
[0053] Document content comes in many formats. 
In order to operate across as many formats as possible, 
the search post-processor will need to be able to con- 
nect to other modules that transform content format into 
the. search format Preferably, all formats will be trans- 
formed into ASGIf format. Some documents may fall 
outside the system's ability to rank them. The system 
will need to distinguish these from those documents that 
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are ranked in a.way: meaningful to the user. ,.- ; 
[0054] It will beiapp^ciatedithat tjie presenjt inven- 
tion may be readily jMptehriehted in software usipg 'soft- ; 
ware development environments that! provide portable 
source code that can "be jused on a variety of hardware 
platforms. Alternatively, the disclosed; system may be 
implemented partial^ or fully jn, hardware- using stand- ■ 
ard logic circuits. Whether software or«hardware;is used 
to implement' tKe system varies depending on the speed 
and efficiency requirements of the system and' also the 
particular function and the particular software, or hard- 
ware systems ! : and; the particular nrjicrofrrope ssor or 
microcomputer systems being; ujilized. < / | , 
[0055] The Invention has been described wrfh refer^ 
ence to a particular! embodiment. . Modifications and 
alterations will occur tjo others upon reading and under- 
standing this specification takentojgether with the draw- : 
ings. The embodiments; are ;but exampies| aind Various 
alternatives.^bdffiiatiofiis, variations 'or improvements 
may be made by those skilled in the art frpmlthis teach- 
ing which are .intended 'to be e ncompasseci | (ay i the. fol- 
lowing claims. |: ; !. ■ { I i [ . : : ;' ! ! ! 



Claims 



1. A system for ranging search results obtained from 

an information retrieval system, comprising : * 

.: ■'»!:' , ; ; ; a ■ ' ■ ! , ' I •■ . 

i ' it * ' ; ' » , ■ i I ! ! ' 

a search prte-propessor, responsive to a, search 
query, tor 1 determining a context 1 of the search 
query; jri accordance with a predetermined user 
context, prof He;' j i j / " <; I \. ' j 
a search' engine, responsive ;to the ! search , 
query, for generating asearoK result compris- 
ing at least o^e'item obtainedifrom the informa- 
tion retrieval system; and 
a search , post-processor, responsive . to the. 
search re. s[j If, for ranking the Itejn jin; accord- 
ance yvith thd cpptext !of the ;search query. - 

2. The system of claim 1, Wherein the usericontext 
profile comprises on^ onrrjore of a U6er prof lie com- 
prising a set 6f:tQmn6 rated ; by the user, a; commu- 
nity profile , comprising a set ot terms 1 rated by : 
members of the;cjommunity, and a* relevance profile 
comprising! a set of terms generated by a recom- 
mender system:; ; ; j , , 

3. The system ot|;clairh 1 . or clairo 2, wherein the 
search' engine jcpmprises a plurality : of individual 
search engines, each search engine j responsive to 
the search,:quejy ( generating a search result com; 
prising at jeasl one jtem obtainedSfrbhri the infotrria- 
tion retrieval system;! and . X 1 1 ' : ' i ' 
wherein.; the pb^t-processor, rejsponsive to the 
search results, ranks the items in/accordance with 
the context of the; search, query 
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The system of any of the preceding ciaims, wherein 
the search : pre-processor determines a profile of 
the query P q - (f, q ,w,. q ) in accordance with the 
predetermined user profile wherein tf* comprise the 
query terms having term weight, wf 1 . 



The system of any of the preceding claims, further 
comprising a'cbntext profiler for generating a con- 
text profile and a document collection comprising a 
set of documents;; the context profiler comprising a 
user profiler for constructing a term-weight vector 
for the user, the tdrm-weight vector being extracted 
from each document in the document collection. 

The system of any of claims 1 to 4, further compris- 
ing a context profiler for generating a context profile 
and a document collection comprising a set of doc- 
uments, the context profiler comprising a commu- 
nity profiler for constructing a term-weight vector for 
the community, -the term-weight vector being 
extracted from each document in the document col- 
lection and the : cdmmuntty comprising a plurality of 
users u. 

The system of claim 6, wherein the community pro- 
file P c = (tj,w c f ) comprises the set of terms {f/} 
with their weights w 0 ,- for each of the individual 
users u in the community. 

The system of claim 7, wherein each member u in 
the community is 'assigned a weight a a and for the 
whole community: 



and wherein the ; weight of term t f in the community 
profile is evaluated as: 



9. The system of claim 4, wherein the search post- 
i processor evaluates each item d in the search 
1 result and generates a document profile 
so 1 P d = (f% w d ,.) 'jjfpr each item d, where f*- is the 
profile term and the weight of each term. : 

i 10. The system of claim 9, wherein the predetermined 
user context profile comprises a community profile, 
55 P c ={tf, w c ;) where 
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15. A method of creating a user context profile for use 
in ranking search results obtained from an informa- 
tion retrieval system, comprising: 



and yPj is ;trie* weight; of t { in a profile of userirln the ' ' s [ 
community.!: m ■ ' f . ; - '. i 

11. The system, of claim 10, wherein the, search post- 
processor; determines tr\e rejeiratice 6f eacfi Item d 
in the searcf) result in accordance With: ! ' , ■ io\ 



: relevance ^ d =? 



I™ - j. , * ; 1 '■' * . i ■ 



where; yfij is the jweight of the term f irija query q, 
and wfyis.trie vector; lenjjth "pro^ejtf :: in the con- 



text profile and evaluated as 



: i I 
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>2S' 



providing a document collection comprising a 
plurality oi[ relevant documents; 
assigning^ rating to each of the documents in 
the docuhibnt collection to generate a docu-. 
ment profile P d -(t d it w d s ) for each docu- 
ment in the collection; 

constructing a weight-term vector, whereinthe 
weight-term vector includes a portion of the set 
of terms {f^} with their weights w^- to form a 
user profile- P" = (f,., iv,) . 16. The method of 
claim; 1 5, furthercomprising: 
for each, document d returned in response to a 
search .query q, generating a document profile, 



evaluating the search 
accordance with: 



document rank in 



ro W? 



relevance „ M - -!■ — 



12. A metKodicrf .ranking:search results obtained from 
an information; retrieval system, comprising: 

providing. a predetermined user context profile; 
generating aisearch : query; ; i ■ l 
applying the context profile to the search query 
to.gerierate -a user context of the search query; 
generating a search result in response to the 
searc[i :qLery, comprising at; least, one; item: 

; pbtaineji f rdrh the informaiion retrieval system; 

1 rahking : tye item in accordance; with; me context 
ofthe search query. I ' ; 



30\ 



'35' 



where wfyis the weight of term t in the 
response ^document d, wf* 0 ? is the weight of 
term t in the user context profile, ^ is the 
weight of the term f in the query q, and »v d is 
the vector length "projected" on the context 
profile and evaluated as 



prising:;! 



1 



generaljng a! plurality. of search'; results, each 
search repult being obtairjed from the informa- 
tion retrieval system; 

ranking". ttye search results in accordance with 
the context ;of the.search^qliiery, and updating 
,the ptedeterhnin'eci user context' profile using 
. highly ; Hanked items returned from ^the search 
queiy r 1 ; j \' \ 



13. The method j of claim 12, whenath, the user context t 40 
prof ile comprisesione orinore pf a user profile corn- ' 
prising a set : qf terms rated byjthe user from a user 
provided document collection^ a community profile 
comprising a get iof terms rated by. members of the , 
community, ■ a|id a relevance ,profj!e ; comprisirig a ; ; 45 
setof temjisjigerierated by a repommender system. t 

• S ; = ■ ■ ; : ' * j j ■ ; • 1 ' 1 ■ 

14. The method; of cjaim ; 12 or claim 13, further com- 
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updating the user context profile using the 
highest ranked item returned from the search 
query. 
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